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ABSTRACT 


Implicit Stereotypes: 
Evidence from Teachers’ Gender Bias* 





| study whether exposure to teachers’ stereotypes, as measured by the Gender-Science 
Implicit Association Test, affects student achievement. | provide evidence that the gender 
gap in math performance substantially increases when students are assigned to teachers 
with stronger gender stereotypes. Teachers’ stereotypes induce girls to underperform in 
math and self-select into less demanding high-schools, following the track recommendation 
of their teachers. These effects are at least partially driven by a lower self-confidence on 
own math ability of girls exposed to gender biased teachers. The findings are consistent 
with the hypothesis that stereotypes impair the test performance of ability-stigmatized 
groups, who end up failing to achieve their full potential. 
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NON-TECHNICAL SUMMARY 


Over the last century, the narrowing of gender differences in education and labor market 





outcomes has been impressive, up to a reversal of the gap in school attainment in many 
OECD countries. Despite that, gender stereotypical beliefs are pervasive and deeply-held 
in most societies. Women are believed to be worst than men in highly profitable fields 
as mathematics, engineering and technology, even controlling for measured ability. 
Stereotypes are overgeneralized and amplified representations of differences among 
groups, which are often based on empirical realities. Indeed, for instance, boys outperform 
girls in math by the age of 15 in most countries according with PISA data (Program for 
International Student Assessment). However, gender differences in math vary substantially 
across countries and increase dramatically throughout the educational career of students. 
Gaining a better understanding of the reasons behind the emergence of gap in math skills 
between males and females is of first-order importance to explain the enduring differences 
in performance and the underrepresentation of women in leadership position and among 
science, technology, engineering, and math (STEM) workforce. To the extent that gender 
stereotypes are internalized directly in the development of self-concept or influence 
investment choices, these cultural beliefs may have causal influence on life-outcomes of 
individuals, shaping educational and occupational careers. 


In this paper, | explore whether exposure to stereotypes can causally affect math 
achievements and track choice of boys and girls. | focus in particular on the influence of 
teachers’ gender stereotypes in affecting student performance, combining administrative 
data and original first-hand questionnaire on students and more than 1.400 teachers in 
Italy. | find that gender gap in math performance increases when students are assigned 
to teachers with higher bias (as measured using a computer- based tool developed in 
social psychology and called Implicit Association Test). The difference in the additional gap 
in math performance between boys and girls generated during middle school would be 
34 percent smaller if teachers had one standard deviation lower implicit stereotypes. The 
effect is driven by students from disadvantaged backgrounds and by lower performance of 
females, while males are not affected by implicit stereotypes. Teacher bias has a substantial 
impact on own assessment of math ability, as measured by detailed information collected 
through an original student questionnaire. This paper shows that biased teachers activate 
negative self-stereotypes on female students only in male typed domains (as math), while 
there is no effect on reading performance. Furthermore, | also provide evidence that 
teacher implicit bias is correlated with their high-school recommendation to students 
and it has an influence on the actual high-school track choice of pupils. The findings are 
consistent with a model of stereotype whereby ability-stigmatized groups underperform 
failing to achieve their potential. Teacher bias fosters low expectations about own math 
ability and underperformance of individuals vulnerable to the gender stereotype. 


1 Introduction 


Over the last century, the narrowing of gender differences in labor market participation and 
educational outcomes has been impressive, up to a reversal of the gap in school attainment in 
many contexts (Goldin et al., 2006). In spite of this, boys outperform girls in math in most 
countries and the gender gap in favour of boys is even wider among the highest-achieving 
students (OECD, 2014). A long-standing debate attributes gender differences in mathematics 
to either biologically based explanations in brain functioning or culture and social conditioning 
(Baron-Cohen, 2003; Ceci et al., 2009; Nollenberger et al., 2016). Cross-countries evidence 
support the latter idea by showing that the more gender equality a country has, the smaller its 
gender gap in math (Guiso et al., 2008; Nosek et al., 2009; Else-Quest et al., 2010).! Gaining 
a better understanding of whether stereotypes affect the emergence of the gap in math skills 
is important to potentially explain the enduring gender differences in readiness for science, 
technology, engineering, and math (STEM) universities and the underrepresentation of women 
in highly profitable occupations and fields (Altonji and Blank, 1999; Card and Payne, 2017). 

The gender stereotypical belief that women are worse than men in scientific fields is perva- 
sive and deeply-held in most societies and by most individuals, including parents and teachers 
(Tiedemann, 2000; Reuben et al., 2014; Bordalo et al., 2016). In this paper, I study whether 
exposure to gender stereotypes of teachers, as measured by the Gender-Science Implicit Associ- 
ation Test, affects educational achievement of boys and girls. Using a unique dataset including 
detailed first-hand surveys and administrative information, I find that teachers with stronger gen- 
der stereotypes have a negative and quantitatively significant influence on girls. First, I show 
that the gender gap in math improvements during middle school increases by 38 percent when 
students are assigned to teachers with one standard deviation higher implicit stereotypes. Said 
differently, the gender gap triples in classes assigned to a math teacher that implicitly associates 
more boys than girls with mathematics compared to classes assigned to a teacher that has the 
opposite implicit associations. Teachers’ stereotypes have no effect on boys’, while they lower 
girls’ math performance, especially of those from disadvantaged background. Second, I provide 
evidence that teachers’ stereotypes induce girls to self-select into less demanding tracks, follow- 
ing the biased recommendation of their teachers. Finally, I show that teachers’ stereotypes have 
a substantial negative impact on girls’ self-confidence in math. The findings are consistent 
with the hypothesis that stereotypes impair the test performance of ability-stigmatized groups, 
who end up failing to achieve their full potential as in a self-fulfilling prophecy (Rosenthal and 
Jacobson, 1968). 





'Ror instance, Nosek et al. (2009) exploits the Gender-Science Implicit Association Test as measure of stereo- 
types and finds that it predicts nation-level sex differences in 8th-grade science and mathematics achievement. 


One of the main challenges to study the impact of gender stereotypes on educational out- 
comes is the availability of a measurement of teachers’ stereotypes, matched with students’ 
achievements and choices. I measure stereotypes by administering the Gender-Science Implicit 
Association Test (IAT) to around 1,400 math and literature teachers, working in 103 schools 
in the North of Italy. This test is a computer-based tool developed by social psychologists 
(Greenwald et al., 1998) and recently used by economists when studying discrimination in the 
context of gender and race bias (Lowes et al., 2015; Burns et al., 2016; Glover et al., 2017). 
The test exploits the reaction time to associations among male or female names and scientific 
or humanistic fields. The underlying assumption is that responses are faster and more accurate 
when gender and field subjects are more closely associated by the individual (Lane et al., 2007). 
Although there is some mixed evidence on its predictive validity (Blanton et al., 2009; Oswald 
et al., 2013), implicit bias has been found to correlate with some outcomes in the real world and 
in laboratory experiments, related for instance to hiring decisions (Rooth, 2010; Reuben et al., 
2014). In addition to IAT scores, I collected detailed information on teacher characteristics, such 
as family background, teaching experience and explicit gender beliefs. To perform the analysis, 
I build a unique dataset, combining these surveys to teachers with administrative information 
on pupils from the Italian Ministry of Education and the National Institute for the Evaluation 
of the Italian Education System (INVALSD, together with a newly collected student question- 
naire. Data on pupils include performance in math and reading standardized test scores, family 
background, high-school track choice, teachers’ track recommendation and — for a sub-sample 
of students — also a measure of self-confidence in own abilities in different subjects. 

The identification strategy relies on the “as good as random” assignment of students to teach- 
ers with different level of implicit stereotypes. I provide supporting evidence showing that base- 
line characteristics of students, such as family background and initial standardized test scores, 
are not systematically correlated with teachers’ stereotypes. First, I investigate the impact of 
teachers’ stereotypes on the gender gap within the class. I add class fixed effects, which ab- 
sorb all characteristics of peers, school environment, and teachers, including the level of gender 
stereotypes. I exploit variations in performance and track choice between boys and girls en- 
rolled in the same class.2 Second, I compare students of the same gender, enrolled in the same 
school and cohort, but assigned to teachers with different level of stereotypes. This exercise 
permits to understand whether the wider gender gap in classes assigned to teacher with more 
stereotypes is due to girls lagging behind, boys improving more, or a combination of the two 


effects. 





*Students are assigned to the same group of peers from grade 6 to grade 8. Teachers are assigned to classes and 
follow students during all years of middle school, with few exception due, for instance, to retirement or transfer to 
a different school. 


This paper makes three contributions. First, I collect a dataset including IAT scores and ex- 
tensive additional information on around 1,400 teachers and I show that implicit associations 
correlates with observable characteristics, such as gender, field of study, and gender norms in 
the place of birth, as measured by the World Value Survey and by female labor force participa- 
tion at province level.* This empirical result supports the view that IAT scores reflect exposure 
to cultural stereotypes (Arkes and Tetlock, 2004). Furthermore, I find that IAT scores do not 
correlate with variables such as gender of own children, teacher quality, experience, and with 
self-reported gender bias, either because they relate to two different mental constructs or be- 
cause there is social desirability bias in the explicit answers (Greenwald et al., 2009). Second, 
the paper provides evidence on the relevance of cultural and social conditioning in affecting the 
gender gap in math achievement and high-school track choice. More precisely, it uncovers the 
role of implicit stereotypes in the context of education economics and pupil-teacher interactions 
that was not previously investigated. This result provides a link between teachers’ stereotypes 
and teacher bias*: it suggests that stronger male-math implicit associations of teachers interfere 
with their interaction with female students and their ability to be unbiased in the classroom, 
even unconsciously — for instance, when they recommend the high-school track to their stu- 
dents. Third, I show the influence of teachers on self-confidence on own math ability. This is a 
crucial channel to explain the underperformance of girls in math when assigned to more biased 
teachers, but also relevant per se since it suggests that the lower self-confidence of women in 
the scientific fields is at least partially activated by exposure to gender stereotypes. 

This study adds to the recent literature in economics that has underlined the benefits from 
interacting with social psychologists and considering implicit bias in studying discrimination 
(Guryan and Charles, 2013; Bertrand and Duflo, 2017). Implicit stereotypes can operate even 
without awareness or intention to harm the stigmatized-group (Nosek et al., 2002; Bertrand 
et al., 2005). In particular, we may expect that teachers do not explicitly endorse gender stereo- 
types, but their implicit stereotypes, embedded in their own experiences since childhood, affects 
their interaction with pupils. My work also contributes to the debate in the social psychological 
literature on what the IAT is measuring and on its predictive power of actual behaviour (Mc- 
Connell and Leibold, 2001; Blanton et al., 2009; Greenwald et al., 2009; Oswald et al., 2013). I 





3This has important implications for the estimation: when teachers controls or class fixed effects are not added 
in the regression, teachers’ stereotypes includes also characteristics correlated with IAT scores. Interestingly, as 
shown in Section 5, the results on the impact of teachers’ implicit stereotypes on the outcomes of interest are 
not affected by the inclusion of teachers’ controls or fixed effects neither in a statistically nor in an economically 
significant way. 

4Stereotypes are overgeneralized and simplified representation of differences between groups, which may hold 
a kernel-of-truth (Bordalo et al., 2017). Indeed, the belief that women are worse than men in math is based on a 
true empirical fact: girls lag behind in math test-scores in most countries by the age of 14. On the other hand, 
an individual is biased only if his or her own preconceived idea interferes with the ability of being impartial and 
objective. 


provide evidence that teachers with stronger implicit stereotypes negatively affect math achieve- 
ments of their female students. They are biased against girls in their track recommendation, with 
long-run implications for their educational career. 

Gender stereotypes are a channel through which teachers matter for performance in stan- 
dardized test scores and future outcomes (Chetty et al., 2014a,b). The economics literature 
analyzing the impact of gender stereotypes of teachers on student outcomes has mainly focused 
on either self-reported measures (Alan et al., 2018) or bias in grading, i.e. the gender differences 
in grades given in blind vs. open evaluations (Lavy and Sand, 2015; Terrier, 2015; Lavy and 
Megalokonomou, 2017).° Compared to other measures of teacher bias, the Implicit Association 
Test has two main advantages. First, it does not suffer from social desirability bias that may be 
an issue in self-reported measures. Second, teachers’ stereotypes are measured without relying 
on data on student performance, which may capture variation in unobservable characteristics of 
boys and girls, potentially correlated with future outcomes of pupils. Finally, a growing number 
of papers exploits the gender of teachers as proxy of exposure to stereotypes of their pupils 
(Bettinger and Long, 2005; Dee, 2005; Carrell et al., 2010; Antecol et al., 2014). In this paper, 
I provide evidence that the gender of teachers is correlated with Gender-Science IAT scores, 
but the impact of implicit stereotypes on student outcomes is similar in terms of magnitude for 
male and female teachers. 

Finally, I contribute to understanding the importance of gender-biased environments in ex- 
plaining the under-confidence of females in STEM fields. Gender differences in confidence and 
competitiveness have negative consequences for women’s performance, educational and occu- 
pational choices (Coffman, 2014; Reuben et al., 2015; Kugler et al., 2017). Exposure to biased 
teachers activates negative self-stereotypes on female students. The results are consistent with 
the predictions of the stereotype threat theory (Steele and Aronson, 1995), according to which 
individuals at risk of confirming widely-known negative stereotypes reduce their confidence 
and underperform in fields in which their group is ability-stigmatized (Spencer et al., 1999). 
Implicit stereotypes create a self-fulfilling prophecy, perpetuating gender differences in math 
performance. 

This paper is organized as follows. Section 2 explains the setting analyzed, providing infor- 
mation on the Italian institutional background. Section 3 describes the data available on both 
students and teachers. Section 4 presents the estimation and identification challenges. The 
main results of the paper are reported in Section 5 and mechanisms are discussed in Section 6. 


Finally, Section 7 concludes. All supplementary material is provided in the Online Appendices. 





SLavy and Megalokonomou (2017), using a panel dataset, show that gender bias in grading of teachers is 
persistent over time and it influences students’ university choice. 


2 Setting 


In the Italian educational system, middle school lasts three years from grade 6 to 8. Students 
are assigned to classes at the beginning of grade 6 and they stay with the same peers for three 
years.© The general class formation criteria are established by an Italian law and details are 
specified by each school council in a formal document available on the website of the institu- 
tion.’ The general criteria mentioned by most schools are equal allocation of students across 
classes according to gender, disability, socio-economic status and ability level (as reported by 
the elementary school). Moreover, I collect additional information directly from the principal 
on how classes are formed. School principals report that the most important aspect in the class 
formation process is the comparability across classes and heterogeneity within class in the same 
school (for detailed information, see Online Appendix B). What is important for my analysis 
is that I can also test whether this intention of the principals is confirmed by the allocation of 
students to classes in my sample (see section 4.3). 

Teachers are assigned to schools by the Italian Ministry of Education and their salary is 
determined by experience in a centralized system. Teachers’ allocation across school is settled 
by seniority: when they accumulate years of experience, they tend to move close to their home 
town and away from disadvantaged areas (Barbieri et al., 2011). Each class is assigned by the 
principal to a math and Italian teacher among those available in the school and they usually 
follow students from grade 6 to grade 8. Every week, students spend at least 6 hours with the 
math teacher and 5 hours with the Italian teacher.® 

Standardized test score in math and reading are administered in grade 2, 5, 6, 8 and 10 by the 
National Institute for the Evaluation of the Italian Education System (INVALSI).” The tests are 
presented to all students as ability tests, thus making the gender stereotype in math potentially 
relevant. They are graded anonymously following a precise evaluation grid and by a different 
teacher than the one instructing students in the specific subject. Students are not informed about 
their performance on the test, except for the one in grade 8. The achievement test score of grade 
8 is the highest stakes among these test scores, since until 2017 it affected 1/6 of the final score 





There are only few exceptions: students may be transferred to a different school by their parents or be required 
by their teachers to repeat a grade (overall less than 6% of students). 

7The D.P.R. 20 marzo 2009 n.81 establishes, for instance, that the number of students per class in middle school 
should be between 18 and 27. Further information at school level is provided on the “Plan of Education Offer” 
(“Piano dell’ Offerta Formativa”). An analysis of Ferrer-Esteban (2011) shows that ability grouping across classes 
within schools occurs almost exclusively in the South of Italy, while all schools in my sample are from the North. 

8Students can be enrolled in school from 30 to 43 hours per week and therefore the amount of time they spend 
with teachers vary. For instance, they spend from 6 to 9 hours with the math teacher. In some classes, Italian 
teachers also teach history and geography so they spend more time with students. The amount of hours per week 
spent with the Italian teacher therefore varies from 5 to 10. 

°The test score in grade 6 was administered only up to the school year 2012-13. 


of students at the end of middle school. However, this final grade has direct impact on the 
enrollment in high-school or on the future educational career of students. 

After middle school, students self-select into three different tracks: academic oriented (“‘liceo’’), 
technical, and vocational high-school. Each type of school is divided in several subtracks: the 
academic oriented track can be specialized in either scientific, humanistic, languages, human 
sciences, artistic or musical subjects, the technical track can be focused on technological or 
economic subjects, while the vocational track can have different core subjects, for instance hos- 
pitality training, cosmetics and mechanical workshop. Students are free to choose a high-school 
with no restriction on the track based on grades or ability and they tend to choose according to 
family background and child’s enjoyment of the curriculum (Giustinelli, 2016). Teachers give 
a non-binding track recommendation to families with an official letter sent to children’s home, 
which is also reported to the Ministry of Education. 

The choice of high-school is strongly correlated with the university choice: 80% of graduates 
in STEM universities in 2015 did a scientific academic or a technical track during high-school 
(62% did the scientific academic high-school track). Among students enrolled in vocational 
track, only 1.7% of the cohort graduating in 2016 enrolled in university, while the percentage 
increases to 73.7% and 32.3% in the academic and technical track respectively. Interestingly, 
among students of the technical track the majority enrolls in either STEM or economics degrees: 
62.5% vs. 52.4% of the academic track students. 


3 Data 


During September 2016, I invited 145 middle schools to take part in a research project regarding 
“The role of teachers in high-school track choice,” out of which 103 accepted and 91 provided 
all information necessary for my study.!° 

The sample was designed including all schools of the provinces of Milan, Brescia, Padua, 
Genoa and Turin with more than 20 immigrants in the school year 2011-12 enrolled in grade 6. 
The Appendix Tables A.I.! and A.I.[I show the balance tables of the characteristics of students 
used in the analysis and those of all Italian students in the same cohorts and of students in the 
same provinces, respectively. Although the standardized difference is always below 0.25 (?), 
the sample used in this paper has a higher share of immigrants compared to the national average 


(20.2% vs. 10.0%) and also compared to the average of the five provinces in the North (20.2% 





'0Tn 103 schools, I obtained the authorization of the principal to administer the survey to teachers, but only 
91 principals completed (without mistakes) the formal authorization to give me access to data from the National 
Institute for the Evaluation of the Italian Education System (INVALSJ). One school has only one class and missing 
data for the standardized test score. The final sample used in the analysis includes 90 schools. 


vs. 14.3%).'! Interestingly, the average score in standardized test in mathematics of both boys 
and girls are similar to the local and national average. 

I use four sources of data: teacher survey data, student survey data, administrative informa- 
tion from the Italian Ministry of Education (MIUR) and from the National Center for the Evalu- 
ation of the Italian Educational System (INVALSIJ). I collected directly detailed information on 
teachers, including implicit stereotypes measured by the Gender-Science Implicit Association 
Test (IAT), and on students’ self-assessment of own ability in different subjects. Administrative 
data from MIUR contains information on students’ gender, place of birth, high-school track 
choice, and their track recommendation. INVALSI provides information on standardized test 


scores and family background. 


3.1 Teachers: Gender Stereotypes and Other Characteristics 


From October 2016 to March 2017, I conducted a survey of around 1.400 math and literature 
teachers. The questionnaire was administered directly by enumerators using tablets in a meeting 
held in school buildings. Participants agreed to take part in the survey and signed an informed 
consent, in which it was explained that the survey was part of a research project aimed at 
analyzing the role of teachers in affecting students’ track choice.'!* There was no reference 
to gender bias. The time to complete the survey was around 30 minutes and teachers did not 
receive compensation for taking it. Among all math and literature teachers working in the 
schools involved in this research, around 80 percent completed our survey thanks to the strong 
support of principals.!* The survey is divided into two parts: the Implicit Association Test (IAT) 


and a questionnaire. 


Gender-Science Implicit Association Test 


I measure implicit gender stereotypes using a tool developed by social psychology and called 
Implicit Association Test (IAT) (Greenwald et al., 1998; Lane et al., 2007). The idea underlying 
the test is that the easier the mental task, the faster the response production and the fewer 


the errors made in the process.'* The IAT requires the categorization of words to the left or 





'l Around half the students are first generation and half are second generation. 

The data collection was conducted for a broad research project involving also an ongoing work in which we 
study teacher race bias (Alesina et al., 2018). 

'3Only 4 math teachers, started the questionnaire and then did not finish it since they claimed either that they 
were not expecting such a long survey or that they could not understand the purpose of the Implicit Association 
Test. I prepared a report for each principal of schools with an attendance rate of teachers to the survey higher 
than 70% with summary statistics on the outcomes of their students during high-school and they were strongly 
interested in it. 

'4This concept was initially developed by Donders (1868). Donders was very optimistic about the possibility 
of quantifying how mind works using the “time required for simple mental processes” and performed some of the 


7 


to the right of a computer or tablet screen and it provides a measurement of the strength of 
the association between two concepts — specifically in the Gender-Science IAT, gender and 
scientific/humanistic fields. Subjects were presented with two sets of stimuli. The first set 
included typical Italian names of females (e.g. Anna) and males (e.g. Luca), and the second 
set included subjects related to scientific (e.g., Calculus) and humanistic fields (e.g., Literature). 
One word at a time appears at the center of the screen and individuals are instructed to categorize 
them as fast as possible to the left or the right according with different labels displayed on the 
top of the screen (for instance, on the right the label “Female” and on the left the label “Male”’). 
To calculate the score, two types of tasks are used: in the first task, individuals are instructed 
to categorize to one side of the screen male names and scientific subjects and to the opposite 
side of the screen female names and humanistic subjects (“order compatible” task), while in the 
second task, individuals are instructed to categorize to one side of the screen female names and 
scientific subjects and to the opposite side of the screen male names and humanistic subjects 
(“order incompatible” task). The order of the two tasks is randomly selected at individual 
level. The idea behind the IAT is that if individuals have implicit associations between men 
and scientific fields, it should be easier and quicker to do the task when they categorize these 
words on the same side of the screen. The measure of implicit stereotypes is calculated as the 
difference in the reaction time in the task in which scientific fields and male names are in the 
same side of the screen compared to the task in which scientific fields and female names are in 
the same side of the screen. Detailed explanation of the IAT is provided in the Online Appendix 
cs 

A broad strand of literature in social psychology and an increasing number of papers in 
economics have provided evidence on the validity of IAT scores in predicting relevant choices 
and behaviors (Nosek et al., 2007; Greenwald et al., 2009). For example, Reuben et al. (2014) 
shows in a lab experiment that higher stereotypes (measured by the Gender-Science IAT) predict 
employers’ biased expectations against female math performance and also suboptimal update 
of expectations after ability is revealed. Higher implicit gender bias is acquired at the beginning 
of elementary school and is generally associated with lower performance of females in math 
during college, lower desire to pursue STEM-based careers and lower association of math with 
self, even for women who had selected math-intensive majors (Cvencek et al., 2011; Nosek 
et al., 2002; Kiefer and Sekaquaptewa, 2007). In the context of race implicit bias, studies have 
shown the relevance of IAT scores in affecting job performance of minorities (Glover et al., 





first experiments making participants respond with the right hand to stimuli on the right side and with the left hand 
to stimuli on the left side. 

'The order of the tasks was randomized at individual level and in the Online Table A.II I provide evidence that 
the impact of the order of the blocks is small in magnitude. However, in all regressions, I control for ordering 
factors, but they do not have a statistically or economically significant effect on the estimates. 


2017) and call-back rates of job applicants (Rooth, 2010). 

There is a lively debate among social psychologists on Implicit Association Tests. First, 
some papers have argued that IAT has weak predictive validity (Blanton et al., 2009; Oswald 
et al., 2013). Most of the studies refer to experiment with less than 50 subjects and they don’t 
have information outside the lab on whether individuals with stronger implicit associations 
are actually biased in their interaction with stigmatized groups. Hence, I believe that further 
research is necessary in order to fully address its predictive validity. My paper can contribute 
to this debate. Second, some studies suggest that IAT can be faked after respondents acquired 
knowledge of the test (Fiedler and Bluemke, 2005). IAT is not widespread in Italy and none 
of the teachers who took the survey reported to be familiar with the test. Without any hints, it 
seems unlikely that they were able to figure out how to trick the test. However, if they were, this 
would increase the noise in the [AT score and lead to an attenuation bias when estimating the 
impact of implicit stereotypes on student outcomes. Third, [AT scores could be contaminated 
by extrapersonal associations that are are available in memory, but that do not contribute to 
an individual’s personal evaluation when one interacts with the specific category (Olson and 
Fazio, 2004). I design a IAT test specifically related to the schooling context by associating 
subjects and gender and teachers do the survey within the school building. The concern of 
capturing associations outside the schooling context is alleviated in this context. Forth, at least 
part of IAT scores are capturing unstable characteristics that do vary over time. For instance, 
race implicit associations have been shown to decrease after subjects viewed pictures of admired 
African Americans and disliked White Americans (Dasgupta and Greenwald, 2001). This short- 
term exposure may introduce additional noise in the measurement. Fifth, there is a debate 
on whether IAT scores should have normative ground and should be used be used in policy 
design (Tetlock and Mitchell, 2009). At this stage of development and considering the noise 
in the score, I would refrain from using IAT scores to make decision about others. They are 
educational tools that may be useful to correct biased behavior (Alesina et al., 2018). Finally, 
some papers in social psychology argue that “the data may reflect shared cultural stereotypes 
rather than personal animus” (Arkes and Tetlock, 2004). This is coherent with my finding of 
a significant correlation between Gender-Science IAT scores and gender norms in the place of 
birth of individuals. 

To sum up, IAT scores are noisy measure of implicit stereotypes that may be affected by cul- 
ture and socialization experience. Nevertheless, they have the great advantage of avoiding social 
desirability bias in the response and capturing implicit association potentially unconscious to 
the individual that may affect his or her interaction with the stigmatized group. In this study, 
I am not interested in whether teacher have stereotypes (i.e. in the level of IAT score), but on 


whether those with higher stereotypes have a negative impact on performance, track choice, and 


self-confidence of girls and boys. 


Teachers’ Questionnaire 


After the Implicit Association Tests, enumerators invited teachers to complete a questionnaire 
asking detailed information about family background of teachers (age, parents’ education, place 
of birth, age and sex of children, etc) and career related aspects (type of contract, years of 
experience, whether they are involved in the management of the school or in the organization 
of Math Olympics Games, etc). Furthermore, they were also asked questions about explicit 
bias, as for instance beliefs about gender differences in innate math ability and the standard 
Word Value Survey question: “When jobs are scarce, men should have more right to a job than 
women”.'© Participants are in general reluctant to explicitly endorse gender stereotypes about 
differences in innate ability and employment (Nosek et al., 2002), potentially leading to social 
desirability bias in the responses. These aspects are emphasized by the awareness of being 
interviewed as teachers. Enumerators collected the allocation of teachers to classes from the 
school year 2011-12 to the school year 2016-17, in order to merge teacher and student data. I 


double check all this information using data provided directly by schools and in their websites. 


Descriptive Statistics on Math Teachers 


The dataset includes 537 math teachers and , but I restrict the main analysis to 303 teachers 
(“matched sample”) who were working for the same school even before 2016 and for which I 
have student data.!’ Online Appendix Table A.III shows the balance table of the differences 
between the sample of teachers matched (303 teachers) and the other 234 math teachers who 
completed the IAT. As expected, teachers not matched are around 9 years younger, 40 percent 
less likely to have full-time contract and they have 12 years less of experience in teaching. 
However, not only the average, but also the entire distribution of implicit gender bias of the 
matched and not-matched teachers is extremely close (exact p-value of Kolmogorov-Smirnov: 
0.971, Online Appendix Figure A.1). 

Table I reports descriptive statistics on math teachers. Most teachers are females (84%), they 
are on average 52 years old with 23 years of experience in teaching and 92% hold a full-time 
contract. The majority (66%) of math teachers are born in a city in the North of Italy, but a 
substantial share is born in the Center or South of Italy and then migrated to the North to work. 


Most teachers graduated from programs in biology, natural sciences and other related subjects: 





'6The specific questions are reported in the Online Appendix C.2. 

'"T have information also on 853 Italian teachers, but the main focus of this paper is on math teachers given 
the strong belief that men are better than women in mathematics. Part of the results exploiting data on literature 
teachers are available in the Online Appendix. Further information available upon request to the author. 
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only 24% studied math, physics and engineering. At the bottom of Table I, I report the summary 
statistics of explicit bias questions described in details in the Online Appendix C. There is little 
variability in the self-reported bias questions, potentially also due to social desirability bias and 
the widespread explicit rejection of stereotypes (especially among teachers). For instance, less 
than 2% of the interviewed teachers respond that they agree with the statement that women have 
less right to jobs than men when opportunities are low. 

Based on the positive average IAT scores, math teachers tend to to (slightly) associate math 
with male names. Considering the thresholds typically used in the social psychological liter- 
ature, 24% of teachers slightly or moderately associates math with girls, 31% presents little 
to no clear associations, 19% shows slight male-math association and 26% show moderate to 
severe male-math associations.!® For comparison, the sample of 1164 Italians used by Nosek 
et al. (2009) have an average Gender-Science IAT score of 0.40 (SD 0.40): the score of math 
teachers is on average lower than this sample (mean 0.09, SD 0.37, as shown in Table I), while 
Italian teachers are very close to it (mean 0.38, SD 0.39).!? Interestingly, the great majority 
of math teachers are women and this may have important implications for the association of 
scientific subjects with gender. For ease of interpretation of my results, I standardize the [AT 


score to have mean zero and variance one in the main results of the paper. 


3.2 Students: Administrative Data and Self-Confidence 


I obtained individual level information from the Italian Ministry of Education and from the 
National Institute for the Evaluation of the Italian Education System (INVALSID) for three co- 
hort of students enrolled in grade 6 between school year 2010-11 and 2012-13. The data 
available include math and reading standardized test score in grade 6 and 8, parents’ education 
and occupation, baseline individual information (date and place of birth, gender, citizenship), 
high-school track choice and official teachers’ recommendation. Students in grade 8 in 2014 of 
24 schools in this sample are asked to complete a survey about their track choice, around two 
months before the end of middle schools. In particular, they need to report their belief about 
their own ability in each subject, choosing between “good”, “mediocre”, “scarce”. 

Table II reports summary statistics on students’ information. I restrict the sample to students 
with information available on the standardized test score in grade 6 and 8 and for whom I have 


the implicit association test of their math teacher in grade 8. This is the sample that is used 





'8Greenwald et al. (2003) suggests that a raw IAT score below -0.15 show bias in favour of the stigmatized 
group, between -0.15 and 0.15 little to no bias, from 0.15 to 0.35 slight bias against the stigmatized group and a 
value higher than 0.35 as moderate to severe bias against the stigmatized group. 

!°Tn the paper by Nosek et al. (2009), individuals completed the IAT online in the Implicit Project website. 

0Individual level data are anonymous and I obtained the authorization from each school principal to access data 
from their school. 
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in the empirical analysis of this paper.*! In the sample, 50% of students are males and boys 
and girls are balanced in terms of baseline characteristics related to place of birth, generation 
of immigration, parents’ education and occupation. Test scores are standardized to have mean 
zero and standard deviation one per subject and year in which the test was taken. Females at the 
beginning of middle school are lagging behind of 0.19 standard deviations in math and ahead 
of 0.13 standard deviations in reading, with respect to males. In the same table, I also report the 
raw gender differences in outcomes. 

The high-school track choice in this sample is comparable to the average national choices in 
those years: females are almost 10 percentage points less likely to choose an academic scientific 
track and almost 25 percentage points less likely to enrol in a technical technological track. 
Girls are more likely to choose an academic track than boys, but not a top-tier ones, which 
include classical and scientific tracks. Indeed, one third of females choose a social, linguistic or 
artistic academic tracks. Vocational school is chosen at an equal rate by both genders. However, 
teachers recommend 36% of males toward vocational track and 30% of females, while the 
scientific track is recommended only to 17% of males and 11% of females.’” Finally, from 
the original information available for a sample of students, I observe that on average there are 
no gender differences in assessment of ability, but females are 9 percentage points less likely 
than boys to consider themselves good at math and boys are 5 percentage points less likely to 
consider themselves good at Italian. 


4 Empirical Strategy 


4.1 Estimating Equation 


The main purpose of this paper is to investigate the impact of teachers’ gender stereotypes on 
student achievement. I exploit two identification strategies. The first is aimed at investigating 


the gender gap within a class, estimating the following equation: 


Vic = A + (Female; x stereotypes.) + O)Female; + e+ (1) 
+X;p1 + (Female; x X;) P2 + (Female; x Ze) P4 + Ejic 


where yj;- is the outcome (i.e. math standardized test score, track choice, and self-confidence) of 
student i in class c. Female; is a dummy variable which assumes value | if the student i is a girl 


and stereotypes, is the standardized value of the IAT score of the math teacher assigned to class 





?1The Online Appendix D describes in details the sample selection and potential attrition issues. 
>2Tn some schools, more than one recommendation is given to students. Here, I report summary statistics only 
for the first recommendation. 
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c in grade 8.2° Linclude fixed effects at class level Nc, which absorb the average effect of teacher 
bias in class c. Furthermore, I include student characteristics X; (initial standardized test score, 
parental education and occupation, immigration status and generation of immigration), and 
teacher characteristics Z, (as gender, place of birth, age, teacher “quality”*, type of contract, 
type of degree achieved and self-reported gender bias) interacted with the gender of student i. 
Standard errors are robust and clustered at teacher level. 

Crucially, in this identification strategy, class, teacher, and school level characteristics are ab- 
sorbed by class fixed effects. Indeed, as described in Section 2, students are assigned to a class 
in grade 6 and attend all lectures with the same classmates until grade 8. We can only identify 
the impact of teacher IAT score on the gender gap in the dependent variable, i.e. the interaction 
between the gender of students and implicit stereotypes of teachers. The coefficient of interest, 
Q;, measures how the gender gap in the class is affected by the assignment to teachers with 
one standard deviation higher stereotypes.*> I expect the estimate of 0 to be attenuated for 
the measurement error in the gender IAT score. Indeed, occasion-specific noise may introduce 
an attenuation bias, as suggested by Glover et al. (2017).*° For robustness, I include controls 
for student characteristics X; interacted with the gender of the pupil. The regression also con- 
trols for the gender of students interacted with teacher characteristics Z,. This is potentially 
important to partial out differential impact on boys and girls of gender, background, and other 
observable characteristics of teachers. Furthermore, this allows to establish whether the impact 
of teachers’ stereotypes on gender gap among classmates can be explained (or attenuated) by 
teachers’ observables. 

The second identification strategy relies on the comparison of students of the same gender 
enrolled in the same school, but assigned to teachers with different stereotypes. I investigate 
whether the impact of teacher IAT score on gender gap is due to higher performance of boys, 
lower performance of girls, or a combination of the two effects. I estimate the following equa- 


tion: 


Yiesy = Bo + Bi (Female; x stereotypes.) + BoFemale; + B3stereotypese + Nsy+ 


(2) 
+Xjp1 + (Female; x X;)p2 + Z-p3 + (Female; x Z.) Pa + Eicsy 





3On average in 70% of the cases professors have been teaching to the same class from grade 6 to grade 8, in 
11% of the cases from grade 7 and in 19% only for grade 8. Two different classes can be assigned to the same 
teacher. 

4Teacher “quality” is proxied by being the teacher in charge of math Olympics in the school, refresher courses 
and years of experience. Online Appendix Table A.V shows that being the teacher in charge of math Olympics in 
the school is correlated with the value added, especially for females. 

°5T discuss the exogeneity of student assignment to teachers in Section 4.3. 

6Glover et al. (2017), while analyzing the impact on manager implicit bias on minority workers, suggest that 
we may expect an attenuation bias of approximately a factor of 1.8 due to measurement error in the IAT score. 
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where 1)sy are school s by cohort y fixed effects and standard errors are robust and clustered at 
teacher level. All other variables are defined as in equation (1). 

Institution level characteristics are captured by school by cohort fixed effects. The advantage 
with respect to specification (1) is that we can analyze the impact of teachers’ stereotypes sep- 
arately on male students (83) and on female students (8; + £3). The drawback is that I cannot 
control for unobservable characteristics at the teacher or class level: this specification exploits 
variation in the level of teachers’ stereotypes to which students of the same gender in the same 


school and cohort are exposed. 


4.2 Correlation between implicit bias and individual characteristics 


IAT scores are correlated with observable characteristics of teachers. Figure I plots the entire 
distribution of implicit bias for math and literature teachers by gender: interestingly, individ- 
uals teaching a subject which is stereotypically associated with their own gender (i.e. men 
teaching math and women teaching Italian) have stronger implicit association male-math and 
female-literature. This result suggests that individuals possess implicit gender stereotypes in 
self-favourable form because of the tendency to associate self with desirable traits — in this 
case, own gender with the subject they teach (Rudman et al., 2001) . 

The richness of the data collected allows me to dig deeper into the determinants captured 
by reaction time to stimuli in the IAT score. Panel A of Table III shows that women teach- 
ing math have lower implicit stereotypes (column 1), but age, education of own mother, and 
whether teachers have children does not have a statistically significant correlation with [AT 
scores (columns 2-5). Gender stereotypical beliefs are rooted in cultural traits, transmitted from 
generation to generation (Guiso et al., 2006). Indeed, I find that exposure to cultural norms 
is strongly associated with the IAT score (Panel B of Table IJ). Column 1 shows that implicit 
stereotypes are correlated with the place of birth of teachers: around 35 percent of math teachers 
in this sample are born in the South where gender norms are stronger, as shown for instance by 
Campa et al. (2010).’ I further investigate this aspect by providing evidence that women labor 
force participation in the province of origin of teachers is negatively correlated with the IAT 
score (Panel B, column 2). Furthermore, I use, as proxy of cultural norms in the province of 
birth, the answers to the World Value Survey question on the relative rights of men and women 
to paid jobs when the latter are scarce.** I find a positive correlation between less conservative 





27Italy is a country with low labor market participation of women, but substantial geographic variation across 
regions. In 2016, only 31 percent of women in the South of Italy were employed, while in the North around 58 
percent were working, similarly to the average of OECD. 

*8Thanks to the data used in Campa et al. (2010), I have access to the answers at province level of the following 
World Value Survey question: ’When jobs are scarce, men have more right to a job than women”. 
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gender norms measured by this question and IAT scores (Panel B, column 3). During the survey 
I administered, I asked the same question to teachers themselves and I find a low and indistin- 
guishable from zero correlation (Panel B, column 4). There may be social desirability bias in 
the self-reported measure when teachers are interviewed in the school. In column 5 of Panel 
B, I correlate implicit bias and explicit beliefs about innate differences in ability between men 
and women and I find a weak positive correlation indistinguishable from zero. This result is not 
surprising in light of social psychology literature, where implicit often differ from explicit and 
self-reported stereotypes (Lane et al., 2007; Nosek et al., 2002). 

In Panel C, columns | and 2, I correlate the IAT score with qualifications of the teacher (type 
of degree and whether the degree was achieved with honour). I find negative point estimates and 
high standard errors. Another rough proxy of quality of teachers is tenure (which is associated 
with higher experience in teaching), and being the professor in charge of math Olympiads in the 
school.?? Also in these cases, point estimates are small and indistinguishable from zero. I also 
check whether the Gender-Science IAT score is correlated with the race IAT score. In the same 
regression as in Table III, I find that the correlation is -0.068 (standard error 0.123). Hence, 
math teachers more biased in one sphere are not more biased also in the other sphere. The [AT 
score does not seem to capture a general “ability” in doing this type of test for math teachers 
(for further information, see Alesina et al., 2018). The Online Appendix Table A.IV shows 
jointly all correlation presented in separate regressions in Table III. Interestingly, the results are 
substantially invariant: gender and place of birth of teachers are the two most relevant aspects 


in affecting IAT scores in all specifications. 


4.3 Exogeneity Assumption 


Next, I present evidence on the absence of a systematic correlation between gender stereotypes 
of teachers and student characteristics, such as socio-economic background and ability. If par- 
ents are able to guess who is the teacher with more stereotyping behaviour, they may try to 
(informally) affect class assignment of their daughters. Although this seems unlikely because 
implicit stereotypes are not easily observable, it is also possible that they try to select teachers 


according to characteristics correlated with IAT score, such as gender and place of birth.°? In 





°In each school, usually only one professor is in charge of math Olympiad and anecdotally she is a highly 
motivated and passionate teacher. Indeed, as shown in the Online Appendix Table A.V, teachers in charge of math 
Olympics induce higher improvements in test scores of their students. 

30Tn Italy, parents dislike being assigned to a teacher with a temporary contract that may have little experience 
and may change during the years of middle school. This paper focuses on variation of exposure to a sample of 
teachers that has been teaching in the same school since at least 2014. They have a lot of experience (on average 23 
years) and almost all have a full-time contract. Furthermore, even if some parents manage to allocate their children 
to teacher with higher ’quality”, it does not necessarily mean that they are less gender biased. For instance, the 


1S 


column | of Table IV, I provide evidence that daughters of highly educated mothers are not less 
likely to be assigned to teachers with more stereotypes than those from lower socio-economic 
background — the difference is not statistically significant and the point estimate goes in the op- 
posite direction. In columns 3, 4, 5 and 6 of Table IV, I analyze the correlation respectively with 
father occupation, immigration background and with a the proxy of ability using standardized 
test scores in reading in grade 6 and I do not find statistically significant correlation. The point 
estimates are also small in terms of magnitude. Finally, in the last column, I also include the 
standardized test score in math in grade 5, before entering middle school, although the sample 
size is substantially reduced for data availability issues.°! The assumption of “as good as ran- 
dom” assignment of students to teachers with different IAT score, within a school, seems to be 
supported in this context. The results are identical when observations are collapsed at teacher 
level, as shown in the Online Appendix Table A.VI. I also check that teachers with higher bias 
are not systematically associated with fewer females in the top or bottom of the distribution. I 
find that this is not the case, as shown by the results considering the share of female students 
in the top 20, top 50 and bottom 20 percent of the distribution in the standardized test score 
in grade 6 presented in the Online Appendix Table A. VII. Finally, principals need to assign all 
math teachers to a class, even if he or she can guess who is the teacher with stronger stereotypes, 
because they don’t have excess of teachers’ paid hours. 

The second aspect regards the absence of systematic grouping of students by socio-economic 
background and initial ability. Within schools, classes are formed by the principal with the main 
objective of creating comparable groups in terms of gender, ability, and socio-economic back- 
ground across classes and therefore to guarantee heterogeneity within each class in the same 
school and cohort. This objective is spelled out in the official documents on the school web- 
sites and also emerges from self-reported information from principals discussed in the Online 
Appendix B. I have information about the observable characteristics of students that are used to 
create classes (gender, education and occupation of parents, immigration status, and generation 
of immigration). Plausibly, unobservable student characteristics are also unknown to school 
principals at the moment of class formation, also considering that students change all teachers 
and school building from elementary to middle school. I check whether class assignments are 
statistically independent with a series of Pearson Chi-Square tests (Lavy and Sand, 2015). First, 
I consider the assignment of individual level characteristics (gender, education and occupation 


of parents, immigration status, and generation of immigration). Then, I also check that within 





teacher in charge of math olympics in the school is usually one of the best math teacher (as shown also in the 
Online Appendix Table A.V). However, if anything, they are slightly more gender biased than others (Table III). 

3!Unfortunately, for reasons related to confidentiality, I have obtained the standardized test scores in grade 5 
only for those students that did not change school code between elementary and middle school. There are only few 
students for which I have this information. 
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each characteristic, class assignment is statistically independent from gender. I find that in less 
than 7.8% of the tests performed, the p-value is lower or equal than 5%°*. This implies that 
for only 7.8% of the classes we cannot reject that there is non-random assignment of one back- 
ground characteristic. There is no evidence of systematic grouping of students according with 


their socio-economic background. 


4.4 Reverse Causality 


Teachers’ gender stereotypes were collected between October 2016 and March 2017 and are 
matched with students who graduated from middle school between June 2013 and June 2015, as 
clarified in the timeline in Figure II. Hence, teacher IAT tests are administered to teachers after 
students in the sample graduated from middle school. As for Glover et al. (2017), a potential 
concern is that IAT scores are affected by exposure to sigmatized groups. Indeed, the IAT is 
expected to be the combination of a trait stable over time capturing individual stereotypes and 
an occasion-specific variation and noise that may be affected by conditions while taking the test 
and stimuli received by the subject in the period right before the test.*? 

Reverse causality seems unlikely for several reasons. First, as shown in Section 4.3, teachers 
with more stereotypes are not systematically assigned to a differential treatment in terms of 
student characteristics, such as family background, standardized test scores in math and read- 
ing (see Table IV, Online Appendix Table A.V! and A. VII). Second, under the assumption of 
monotonic decay of the influence of exposure to students, I would expect a higher effect for the 
most recent cohort. However, results are stable in all three cohorts, as shown in the robustness 
analysis (Online Appendix Table A. VIII). Third, math teachers included in our analysis have 
been teaching on average for 23 years (with a median of 25 years) and therefore over time they 
were exposed to hundreds of females and males students. Furthermore, for data availability 
issues, we do not include in the sample the cohort of student graduating right before the school 
year in which the IAT test was administered. Each teacher has been exposed on average to 
4 classes (around 100 students) after those students included in our analysis graduated from 
middle school.** 

In fact, there is a main advantage from exploiting this timeline: taking the IAT or knowl- 
edge about this study could not have affected students’ performance nor teachers’ or parents’ 


attention to the issue of gender stereotypes for the cohorts of boys and girls in this dataset. 





32Given the size of the Table, it is not reported in the paper but it is available upon request to the author. 

33The test-retest reliability of IAT is generally considered as satisfactory by social psychology, with a correlation 
of 0.56 that does not change with the length of time between testing (despite being usually of less than one month 
in most studies) (Nosek et al., 2007). 

34S tudents who were enrolled in middle school in the school year 2015-2016 and 2016-2017 are not included in 
the sample because they did not take one of the standardized test score. 
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5 The Impact of Teachers’ Implicit Bias 


5.1 Performance in math 


By the age of 14, girls are lagging behind in math compared to their male classmates by around 
0.22 standard deviations, a result comparable to several other countries (Fryer Jr and Levitt, 
2010; Bharadwaj et al., 2016).*> As children complete more years of education, the differences 
between boys and girls gets bigger. The additional gender gap in math generated during the last 
two years of middle school is around 0.08 standard deviations, as shown in column 2 of Table 
V. This paper analyzes what happens to the gender gap when students are assigned to teachers 
with stronger gender stereotypes. 

Table V shows the effect of teachers’ implicit stereotypes on gender gap in math performance 
within the class, presenting the results of estimating equation (1). Classes that are assigned to 
teachers with one standard deviation higher IAT score have 0.03 standard deviations higher 
gender gap in math performance. It corresponds to an increase of 38 percent of the gender 
difference in performance generated during middle school, considering an average gap of 0.08 
standard deviations. Column 4 includes student characteristics X; and their interaction with 
gender of the children. Adding these controls does not change the coefficient of interest. All 
results exploit information on three cohorts of students. in the Online Appendix Table A. VIII, I 
show the effect of the main specification presented in Table V for the three different cohorts of 
students separately. Reassuringly also for the potential reverse causality concerns expressed in 
Section 4.4, results are not statistically different in the three cohorts.°° 

Although the level of teachers’ stereotypes and all characteristics are absorbed by class fixed 
effect, as clarified describing equation (1), column 5 includes the interaction between student 
gender and teacher characteristics Z;,. If anything, the coefficient of interest “Fem*Teachers’ 
Stereotypes” slightly increases in magnitude when all these interaction effects are absorbed. 
Observable characteristics of teachers, interacted with students’ gender, are not driving the re- 
lation between gender gap and teachers’ stereotypes. I report the coefficients only for the main 
characteristics of teachers interacted with students’ gender, but the effects are mainly small and 


insignificant for all variables, including age, parents’ education, whether he or she has daugh- 





35in the Online Appendix Figure A.II, I show the average gap in PISA test scores across countries. According 
to a meta-analysis performed on 100 studies in several countries, gender gaps in mathematics are around 0.29 
standard deviations in high-school (Hyde et al. (1990), two years after the end of middle school. The average 
gender gap without controlling for class fixed effects is substantially invariant (0.20 standard deviations as shown 
in Table II). Most of the variation in math performance is within classes, coherently with the aim in class formation 
of creating groups heterogeneous within class and homogeneous across classes. 

36For the first cohort, I have fewer observations because some schools change the code identifying the school 
that year for administrative reasons and I am not allowed to access data identified with the older codes. 
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ters, whether he or she achieved the degree with /aude, the type of teaching contract, refresher 
courses and appointment as teacher in charge of math Olympics. The latter controls are crude 
proxies for teacher quality in terms of improvements in standardized test scores, as shown ana- 
lyzing the relation between value added and teachers observables in the Online Appendix Table 
A.V. Teachers born in the North of the country do not have a differential effect on boys and 
girls. 

Finally, as it can be seen in column 5 of Table V, ceteris paribus, female students assigned 
to female teachers have slightly lower, albeit insignificantly so, math performance in test scores 
in grade 8 compared to their classmates*’. The impact indistinguishable from zero of teacher 
gender is coherent with the result of Bharadwaj et al. (2016). However, other studies find 
that having a teacher of own gender helps improve performance, especially at college level 
(Dee, 2005; Carrell et al., 2010). In the Online Appendix Table A.IX, I split the sample among 
student of male and female teachers. Although the effect is not statistically significant for pupils 
assigned to male teachers due to a small sample size, the point estimates show that the impact 
of teachers’ implicit stereotypes on student performance is similar in terms of magnitude for 
male and female teachers. What seems to matter is whether the teacher has gender stereotypes 
and not the gender of the teacher per se — even if as I discussed above men tend to have higher 
IAT scores than women in the sample of math teachers. 

To give a clearer interpretation, Figure III reports the same estimates of Table V using a cat- 
egorical variable instead of the continuous one. I consider the thresholds defined by Greenwald 
et al. (2003), where “no stereotypes” is the interval of IAT raw score between -0.15 and +0.15, 
while “boys-math” and “girls-math” indicate a stronger association of the scientific field with 
male and female names, respectively. Being assigned to a teacher with a “boys-math” attitude 
(45% of teachers) compared to a teacher with a “girls-math” attitude (25% of teachers) leads to 
triple the gender gap in math improvements within the class (from -0.035 standard deviations 
to -0.10 standard deviations). The same results are reported in columns 1-3 of Online Appendix 
Table A.X, while in columns 4-6 I consider a dummy for whether IAT score of the teachers is 
positive or negative, finding a similar pattern. 

Are biased teachers worse instructors or are they helping boys to learn math? I next inves- 
tigate the effect of teacher bias from estimating directly equation (2), comparing students of 
the same gender within the same school and cohort, but assigned to different classes. Figure 
IV shows that having a teacher with strong gender stereotypes has a negative impact on female 
students, while a bias in favour of girls has a positive impact in their math improvements. The 


linear approximation presented in this paper seems to adequately represent the data. There is no 





37It should be noticed, however, that most of teachers in Italian middle schools are females, also in math. There 
is little variation on the gender of teachers. 
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statistically significant impact on male students, throughout the whole distribution of teachers’ 
IAT score. Column 3 of Table VI mirrors Figure IV: it presents the results of the regression 
analysis and shows that girls are lagging behind when assigned to more bias teacher, while boys 
are not affected by teachers’ stereotypes. The results are robust to the inclusion of the same 
controls as in Table V. In this specification the characteristics of teachers are not absorbed by 
class fixed effects and therefore controls at teacher level, included in columns 5, are particularly 
relevant. 

The differential response by gender is consistent with the previous results in the economic 
literature: women are more responsive to negative feedbacks then men in stereotypically male 
fields, such as math (Kugler et al., 2017). Coffman (2014) finds that individuals are signifi- 
cantly less likely to contribute with their ideas in gender incongruent fields and this is partic- 
ularly strong for women, leading to more missed opportunities among female in male-typed 
categories than for males in female-typed categories. Furthermore, the type of task affects gen- 
der differences in the willingness to complete, with wider gaps in stereotypically male tasks 
(Niederle and Vesterlund, 2010; GroBe and Riener, 2010). 


Heterogeneous effects 


I now examine which students are the most affected by teachers’ stereotypes, considering their 
background characteristics and the time of exposure to their teachers. Table VII shows that the 
effect of implicit stereotypes is stronger for the most disadvantaged groups of female students. 
Based on the estimates in column 2, a standard deviation increase in teacher bias leads to 0.057 
standard deviations higher gender gap among students with low educated mothers and of 0.029 
standard deviations among students with mothers who completed at least high-school, although 
the difference is indistinguishable from zero at conventional levels. In the following column, I 
analyze the impact of teacher bias in the three terciles of the distribution of the standardized test 
score in grade 6. The effect is stronger for students in the lowest tercile (-0.078, with standard 
error 0.029) and turning positive, but not statistically significant, only for students in the top 
of the initial ability distribution in grade 6. Finally, the effect if anything is slightly stronger 
among immigrants, even if the difference with natives is indistinguishable from zero. 

Why do girls from disadvantaged backgrounds suffer the most from the interaction with 
biased teachers? The empirical evidence presented is coherent with the stereotype threat model 
(Steele and Aronson, 1995): individuals with higher risk of conforming to the predicament 
that “women are bad at math” are those more deeply affected. Indeed, male students are not 
influenced by teachers’ stereotypes and among females those strongly affected have lower initial 
math achievements and therefore are at higher risk of confirming the negative expectations 
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on their group. The Online Appendix E presents a conceptual framework that illustrates how 
teachers’ stereotypes can differentially affect effort and outcomes of students in the bottom 


and the top of the ability distribution.*® 


One complementary explanation, coherent with the 
interaction theory (McConnell and Leibold, 2001), is that female students with highly educated 
mothers or with higher initial level of math achievement may need less interaction with their 
math teacher in order to avoid lagging behind with their peers. They are more likely to have both 
additional support to believe in their own abilities and alternative role models. Interestingly, this 
result is also coherent with the evidence from Tiedemann (2002): teacher perception of math 
ability of their students is biased mainly toward average and low achieving female students who 
are perceived as less talented compared to their actual performance. 

In order to investigate further this aspect, I analyze the differential effect according to the 
“quantity” of interaction time between teacher and students. The last two columns of Table VII 
analyze whether there are heterogeneous effects in terms of years of exposure and hours per 
week. Indeed, around 75% of students interact with the math teacher for six hours per week, 
while the rest for 9 hours per week. Furthermore, I exploit the fact that around 20% did not 
have the same teacher for all three years of middle school. However, for both variables, I do 
not see a statistically or economically significant pattern. Most likely the impact of teacher 
gender stereotypes begins at lower intensive margins and I do not have proxies of the “quality” 
of teacher- student interaction that would be necessary to further investigate this mechanism. 


5.2 Choice of High-School Track and Teachers Recommendation 


High-school track choice is the first crucial career decision in the Italian schooling system. 
Students and their families are free to choose their most-preferred track, with no constraints 
based on grades or teachers’ official track recommendation. There are three main types of high- 
school: academic, technical, and vocational. As shown in Table II, there are substantial gender 
differences in the type of track selected: the preferred choice among females are academic track 
related to psychology, languages and art, while for males the preferred choices are academic 
scientific and technical technological tracks. Students in different tracks have in most cases 
little to no interaction during the school day since buildings are separated. Finally, the choice 
of high-school is strongly correlated with university choice, as discussed in Section 2. The 
scientific academic path easily opens up career opportunities in STEM related fields, while the 
vocational choice is highly correlated with almost no tertiary education. Hence, I explore the 
impact of teachers’ steretypes on the track choice at the end of middle school, with a focus on 


the choice of the scientific academic track and on the vocational track. 





38This conceptual framework is an extension of the stereotype threat model presented by Dee (2014). 
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Table VIII, Panel A, shows that girls are 9.5 percentage points less likely than boys to attend 
a scientific track and equally likely to attend a vocational track. Teachers also recommend less 
girls toward a scientific track (Panel B, column 1). The gap is reduced but still substantial after 
controlling for the standardized test score in math. I find a close to zero and insignificant effect 
of teachers’ stereotypes on the gender gap in scientific track choice (Panel A, columns 2-4) and 
in the recommendation of teachers toward this path (Panel B, columns 2-4). The inclusion of 
controls at student and teacher level interacted with the gender of pupils do not affect the point 
estimates of interest. 

Recent work suggests that women are more responsive to negative feedback than men in 
STEM fields (Kugler et al., 2017). However, the scientific track is chosen by females with 
highly educated parents or with high achievement test scores, whose performance was not af- 
fected by teacher bias, as shown analyzing the heterogeneous effects in Section 5.1.°” These 
female students are likely to have other academic-oriented role models in addition to their math 
teacher and a lower vulnerability to the gender stereotypes. 

Teachers’ stereotypes have strong impact at the bottom of the ability distribution. Indeed, as 
reported in columns 6 of Panel A, girls, when assigned to a teacher with one standard deviation 
higher implicit stereotypes, are more likely than their male classmates to attend vocational track 
by around 2 percentage points. This effect corresponds to an increase of 13% with respect to 
the mean probability of attending vocational training for girls. This result mirrors an analogous 
differential in teachers’ track recommendation toward vocational school as shown by Panel B, 
columns 6. The subsequent two columns include characteristics of teachers and pupils and their 
interaction with the gender of the latter. Adding these controls does not change the coefficient 
of interest. When exposed to less gender-biased environment, female students are more likely 
to attend the technical track (see Online Appendix Table A.X1). 

Figure V reports the same estimates using a categorical variable instead of the continuous one 
to offer a clearer representation of the results.*9 Girls assigned to a teacher with a “boys-math” 
attitude have a probability of 18.5% of attending the vocational track, while female students 
assigned to a teacher with a “girls-math” attitude have a 6.1 percentage points lower probability 
of attending the same track, which corresponds to a decrease by 33%. 

The Online Appendix Table A.XII shows the results estimating equation (2), with school 
instead of class fixed effects. They confirm the previous evidence of a substantial impact on 
female students in terms of choice of vocational training, while boys are slightly less likely 





3°In the questionnaire administered to teachers, I ask them why girls, compared to boys with the same math 
performance, are less likely to attend the scientific track: the reason identified as the most important by teachers is 
the parental influence (for the summary statistics see Table I). 

40T use the thresholds defined by Greenwald et al. (2003) and exploited also in Figure III. For more details, check 
Section 5.1. 
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to be recommended by their teachers toward vocational track and also less likely to choose it. 
Finally, Online Appendix Table A.XIII presents results from the heterogeneity analysis and, 
as expected, the impact of teacher bias has a stronger effect on the track choice of female 
students from disadvantaged background. The enrolment of females from the bottom tercile of 
the distribution increases by 5.1 percentage points for one standard deviation higher bias of the 
math teacher (which corresponds to a 16.4% increase with respect to the mean value for this 


group). 


5.3. Additional Outcomes 
Reading Performance and Stereotypes of Literature Teachers 


Table A. XIV in the Online Appendix exploits the information on literature teachers that I have 
collected through the same IAT test and questionnaire. The gender gap in reading is reversed 
compared to the one in math, similarly to most OECD countries (Fryer Jr and Levitt, 2010). 
Female students are 0.196 standard deviation better in reading compared to male classmates at 
the end of the middle school and the gap is increasing of 0.082 standard deviation from grade 
6 to 8, as shown in the first two columns of the Online Appendix Table A. XIV. However, the 
gender stereotypes of literature teachers are not affecting this gap, as shown in the subsequent 
columns of the same Table. The effect of interest is close to and indistinguishable from zero 
and it is not affected by the inclusion of teacher and student controls. 

The Online Appendix Table A.X'V investigates the impact of teacher stereotypes, consider- 
ing the implicit IAT of both literature and math teachers and therefore restricting the sample 
to those classes for which these information are jointly available. The implicit stereotypes of 
literature teachers does not have a significant impact neither on math nor on reading standard- 
ized test scores (columns 1-4). Hence, the inclusion of their IAT score does not affect the 
negative and statistically significant effect of math teacher’ stereotypes on math performance. 
Interestingly, being assigned to a math teacher with stronger implicit stereotypes seems to have 
negative, although indistinguishable from zero, effect on performance in reading, suggesting 
that female students do not simply substitute their effort in math with more effort devoted to 


studying literature. 


Bias in Grading 


Previous literature has shown the importance of gender bias in grading (i.e the gender difference 
in standardized test score and unblind grades given by teachers) in affecting performance in 


math and university choice (Lavy and Megalokonomou, 2017; Lavy and Sand, 2015; Terrier, 
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2015). A natural question is whether IAT score affects bias in grading of teachers. I have 
information only on grades given by teachers at the end of the semester. As shown in Table 
A.XVI, girls on average get a higher grade compared to boys with the same standardized test 
score in math*!. Females assigned to teachers with more stereotypes get a slightly lower grade, 
but the effect is indistinguishable from zero. However, it should be considered that grades are 
categorical variable from 2 to 10, where 6 is the pass grade. As it can be clearly seen by Figure 
Ai] in the Online Appendix, there is a high bunching at the pass grade and almost half of 
the students obtain the same grade in math. Hence, there is little variability on this variable 
especially at the bottom of the distribution. 


Retention rate 


In the Italian schooling system, at the end of each academic year, teachers decide whether the 
student is admitted to the following grade. This decision is based on the overall assessment 
of students, including both performance and behavior in class. The retention rate of males is 
higher compared to the one of females. For instance, among students who attended the test 
score in grade 6 (9837 students), 6.0% of males and 3.3% of females are retained in (at least) 
one of the three years of middle school. In Table A.X VII, I check whether math teachers’ bias 
has an impact on retention rate, but I do not find any significant impact, neither without nor with 
the inclusion of the controls at teacher and student level. Furthermore, I also check that teacher 
implicit stereotypes does not differentially impact the probability of taking the standardized test 
score in grade 8 (Table A.X VII, columns 5-8), conditional on taking the one in grade 6. These 
results suggest that the sample used in our main table on performance in math is not biased by 
differential attrition by gender, induced by teacher bias. Additional checks on potential sample 


selection issues are addressed in the Online Appendix D. 


Explicit Bias 


In Table A.X VIII of the Online Appendix, I consider the impact of self-reported gender differ- 
enced in innate math abilities on student outcomes. I find that it has an impact indistinguishable 
from zero, although the effect of more conservative gender norm on girls’ performance is nega- 
tive, in the same direction as the results reported by Alan et al. (2018). However, unfortunately 
the proxies available of gender norms are few and with little variation.” Finally, the impact of 





4lHere, the results are shown controlling for the standardized test score in grade 6, but the magnitude and 
significance is very close when controlling for the standardized test score in grade 8. 

41 collected also more detailed information on the reasons behind the gender gap in track choice between girls 
and boys who are more clearly correlated with the performance of students by gender, but in this context may be 
affected by reverse causality. 
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IAT score on student achievement is not significantly affected when I control for reported bias. 
This evidence seems to support the distinctiveness of implicit and explicit cognition (Greenwald 


et al., 1998) in the context of gender stereotypes of teacher. 


6 Discussion of Potential Mechanisms 


In this section, I discuss the mechanisms behind the negative impact of teacher bias on student 
achievement. I focus mainly on two aspects: self-confidence and interaction theory.** I use 
student survey data to deeply analyze the former aspect, while for the latter I rely mainly on 
the social psychology evidence on the interaction between teachers and pupils by gender. in the 


Online Appendix E, I present a conceptual framework including both aspects. 


Self-Confidence 


Self-confidence plays a crucial role in affecting performance, especially in gender-incongruent 
areas, such as female performance in math (Coffman, 2014). According with social psychology, 
the development of academic self-concept begins since childhood and is strongly influenced in 
the period after elementary school by stereotypes communicated by significant others, such 
as parents and teachers (Ertl et al., 2017). Girls may believe that both own signal of ability 
and the signal received by teachers carry relevant information. However, if the signal received 
from teachers is biased by beliefs that women have lower ability than men in math or are less 
suitable for a STEM career, females may develop a lower self-assessment of own ability in the 
scientific field and potentially invest less in their STEM education. The idea is consistent with 
the stereotype threat theory developed in social psychological literature (Steele and Aronson, 
1995), according to which individuals at risk of confirming widely-known negative stereotypes 
reduce their confidence and underperform in fields in which their group is ability- stigmatized 
(Spencer et al., 1999),*4 


Table IX assesses the extent to which teachers’ stereotypes affect one’s own assessment of 





43There is a third theory that could be consistent with the negative impact of teacher bias on female student 
math performance. According with the animus theory, teachers may dislike female students, treating them badly 
or giving them more unpleasant assignments, causing girls to dislike math. In our context, it seems unlikely that 
teachers assign different tasks to students by gender in terms of exams or homework. Furthermore, as shown in 
Section 5.3, teachers tend to favour female students in math grading, compared to blinded scores, as emerges in 
several other countries (Lavy and Sand, 2015; Terrier, 2015). 

“4Nespite the rich literature in social psychology about stereotype threat since 1990s, only recently economists 
have directly analyzed this phenomenon, finding partially contradictory evidence. One of the first steps taken in 
this direction has been Fryer et al. (2008), which finds no evidence of stereotype threat behavior in influencing 
women’s performance in math, while Dee (2014) shows a substantial impact of activating negatively stereotyped 
identity (i.e., student-athlete) on test score performance. 
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ability, for a sample of around 800 students for whom I collected self-confidence measures.*° I 


present results for self-stereotypes in math in Panel A, in reading in Panel B and on average of 
all other subjects in Panel C. As shown in column 1, girls are 9.4 percentage points less likely 
to consider themselves good at math (which corresponds to 11% percent lower probability than 
males). Female students are generally found to be more critical about their abilities in math than 
male students even if they have the same grade, as shown in PISA tests as well (OECD, 2015). 
However, females are 5.2 percentage points more likely to consider themselves good in Italian 
(which corresponds to 6% percent higher probability than males), but on average both equally 
assess their own ability. In classes assigned to math teachers with higher bias, the gender gap in 
self-assessment of own ability in math increases. In particular, in classes assigned to teachers 
with one standard deviation higher IAT score, the gender gap in self-assessment increases by 
4.9 percentage points, controlling for the test score in grade 6 as in our main specification in 
equation (1). Adding student and teacher level controls interacted with pupil gender do not 
substantially affect the point estimate of interest (columns 3 and 4). 

In Section 5.1, I provide evidence that the gender gap in math achievement increases during 
middle school in classes assigned to a more biased teacher. Hence, in the last three columns of 
Table IX, I also control for the mediating role of performance measured at the end of middle 
school in order to analyze whether gender gap in own assessment is merely due to different 
performance in grade 8. I find that gap in own assessment is reduced by one third. However, 
teachers’ stereotypes seems to have an additional impact on own assessment of math abilities, on 
top of performance in standardized test score, that may have detrimental effects for investment 
choices in education and occupation. 

In Panel B and C of Tables [IX , I focus on the impact of teachers’ steterotypes on self- 
assessment respectively in Italian and all other subjects. Although girls do not improve their 
performance in reading when assigned to teachers with higher IAT score as shown in the Online 
Appendix Table A. XIV, they have higher self-confidence in Italian. One potential explanation 
is related to the framing of the question: students are asked to report whether they believe they 
are “good”, “mediocre” or “bad” at each subject. They may want to avoid saying that they 
are “bad” at both the two crucial subjects (math and literature) and compensate the low self- 
confidence in math with higher self-assessment in Italian. There is no impact on other subjects. 
The effects are robust to the inclusion of controls at individual level (column 3 and 4), at teacher 
level (column 4), and also to the inclusion of standardized test score in grade 8. 


This result is important for at least two reasons. First, it shows that self-confidence of women 





‘SThis measure of self-confidence is correlated with future educational choices of individuals. For instance, 
students with higher self-confidence in math are more likely to attend the scientific track, even controlling for 
standardized test score at the end of middle school. Table available upon request to the author. 
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in math is affected by social conditioning from teachers. Second, this is an important mecha- 
nism to understand the effect of teachers’ stereotypes on math performance and track choice of 


female students. 


Interaction Theory 


A second potential mechanism is related to the interaction theory (McConnell and Leibold, 
2001): teachers of scientific subjects with stronger implicit association between math and boys 
may spend less time (in terms of either quantity or quality of time) interacting with girls, es- 
pecially those performing poorly. Indeed, teachers may choose to allocate more time or tailor 
math classes to the learning of boys and top-performing girls since they are more likely to attend 
a STEM track during high school. If so, I would expect higher achievement of these groups of 
students when exposed to a gender-biased teachers. However, as shown in Table VII and VI, I 
find that these groups of students are not affected by teacher stereotypes. 

Unfortunately, I do not have measures of the “quality of interaction” between teachers and 
student by gender to directly test the interaction theory. However, the social psychology liter- 
ature provides evidence that math teachers interact differently with male and female students. 
It has been shown that they believe math is more difficult for girls than equally achieving boys 
(Riegle-Crumb and Humphries, 2012; Tiedemann, 2002).*° Furthermore, Sadker and Sadker 
(2010) document that teachers spend more time interacting with boys, while Hyde et al. (1990) 
suggests that math is taught as a set of computational methods to girls, while boys are encour- 
aged to exert independence. Finally, Keller (2001) find that teachers convey their stereotyping 
of mathematics as a male domain through their classroom instruction and affect students’ own 
association between math and gender. 

Teachers’ erroneous expectations may lead to a self-fulfilling prophecy: biased teachers may 
set a lower bar for the learning of students from sigmatized groups and they may fail to rec- 
ognize their talent and not encourage them to fulfil their potential (Rosenthal and Jacobson, 
1968; Cooper and Good, 1983). This is coherent with the evidence presented in Table VIII on 
biased track recommendation to girls from teachers with stronger associations math-males and 
literature-females. Gender-biased interaction between pupils and teachers may be an impor- 
tant mechanisms behind the main results of this paper but further research is necessary to fully 
address this aspect. 





46Using Italian data from INVALSI, I show in the Online Appendix F that this perception of teachers mirrors 
a self-perception of students. Female students compared to boys with the same performance are more likely to 
believe their achievement is the result of effort and less likely to believe it is the result of ability. 
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7 Conclusion 


In most OECD countries, women outnumber men in tertiary education, but they are by far a 
minority in highly paid fields such as science, technology, engineering and math, especially 
when excluding teaching careers. The prospects for change are not optimistic considering that 
on average in OECD countries less than 5 percent of 15-years-old girls are planning to pursue a 
career in these fields compared to around 20 percent of boys according to 2015 PISA data. Cul- 
ture and social conditioning have a strong impact on the development of skills and educational 
choices. This paper shows that one third of the gender gap in math performance created during 
middle school can be explained by teacher implicit stereotypes. Girls, especially those from 
disadvantaged backgrounds, are lagging behind when assigned to teachers with stronger im- 
plicit associations math-males and literature-females. Boys, the group not ability- stigmatized 
in terms of math performance, are not affected by teacher bias. Teachers’ stereotypes influ- 
ence high-school track choice, inducing more female students to attend a vocational school. 
Furthermore, they foster low expectations about own ability and lead to underperformance in 
male-typed domains. Indeed, girls are more likely to consider themselves bad in math at the 
end of middle school if they are assigned to a biased teacher, even controlling for their abil- 
ity measured by standardized test scores. These findings are consistent with a model whereby 
ability-stigmatized groups under-assess own ability and underperform fulfilling negative expec- 
tations about their achievements. Implicit associations can form an unintended and often an 
invisible barrier to equal opportunity. 

These results raise the question of which kind of policies should be implemented in order 
to alleviate the impact of gender stereotypes. The implicit bias measured by IAT score at this 
stage of development should not be used to make decisions about others, as hiring or firing 
decisions. IAT scores are educational tools to develop awareness of implicit preferences and 
stereotypes. Hence, one set of potential policies may be aimed at informing people about own 
bias or training them in order to assure equal behavior toward all individuals, especially within 
the schooling context (Alesina et al., 2018). An alternative way to fight against the negative 
consequences of stereotypes is increasing self-confidence of girls in math or providing alterna- 
tive role models — as done in the context of Indian elections, where exposure to female leaders 
weakens gender stereotypes in the home and public spheres (Beaman et al., 2009), or in French 
schools, by offering alternative role models (Breda et al., 2018). More research is needed to 


further investigate the impact of both type of policies. 
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Figures and Tables 


Figure I. Teachers’ Implicit Gender Bias (IAT measure) by gender and subject they teach 
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Notes: This graph shows the distribution of Gender-Science IAT scores for math and literature teachers, 
separated by gender. A higher value of implicit bias indicates a stronger association between scientific- 
males and humanistic-females. Zero indicates no gender stereotypes. The graph provides evidence that 
teachers in gender-incompatible fields have stereotypes closer to zero. 
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Figure II. Timeline of main data available for students and teachers 
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Notes: This graph shows the timeline of data collected for the three cohorts of students. They graduated 
from middle school between 2013 and 2015. Teachers were surveyed between October 2016 and March 


2017. Standardized test scores are administered at the end of grade 6 and 8. 
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Figure III. Effect of teacher bias on student math performance 
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Notes: This graph shows the effect of teachers’ stereotypes on student achievement. We consider the 
thresholds defined by Greenwald et al. (2003) where no bias is the interval of IAT raw score between 
-0.15 and +0.15. The attitude of the teacher in associating fields with gender is a stronger association 
“girls-math” if the score is lower than -0.15 (25% of teachers) and “boys-math” if the score is higher than 
+0.15 (45% of teachers). The variable in the y axis is the gender gap in improvements in math between 
grade 6 and 8, when class fixed effects are absorbed. 
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Figure IV. Effect of teacher bias on student math performance by gender 
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Notes: This graph shows the effect of teachers’ stereotypes on student achievement by gender. The 
variable in the y axis is the residualized standardized test score in grade 8, after controlling for school by 
cohort fixed effects, student and teacher level controls. The variable in the x axis is the raw IAT score. 
A higher value of implicit bias indicates a stronger association between scientific-males and humanistic- 
females. 
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Figure V. Effect of teacher bias on choice of vocational track of females 
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Notes: This graph shows the effect of teachers’ stereotypes on female students’ track choice. We consider 
the thresholds defined by Greenwald et al. (2003) where no bias is the interval of [AT raw score between 
-0.15 and +0.15. The attitude of the teacher in associating fields with gender is a stronger association 
“girls-math” if the score is lower than -0.15 (25% of teachers) and “boys-math” if the score is higher than 
+0.15 (45% of teachers). The variable in the y axis is the gender gap in improvements in math between 
grade 6 and 8, when class fixed effects are absorbed. 
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Table I. Summary Statistics from Math Teachers’ Questionnaire 








Count Mean SD Min Max 





Family and education 








Female 303 0.84 0.37 0.00 ~~ 1.00 
Born in the North 293 0.66 0.48 0.00 ~~ 1.00 
Age 292 51.86 8.39 31.00 66.00 
Children 303 0.75 0.44 0.00 1.00 
Number of children 217 1.84 0.80 0.00 5.00 
Number of daughters 217 0.86 0.77. 0.00 3.00 
Low edu Mother 280 0.58 0.49 0.00 = 1.00 
Middle edu Mother 280 0.29 0.46 0.00 = 1.00 
High edu Mother 280 0.12 0.33 0.00 1.00 
Advanced STEM 294 0.24 043 0.00 ~~ 1.00 
Degree Laude 258 0.17 0.38 0.00 1.00 
Job characteristics 

Full time contract 287 0.91 0.28 0.00 1.00 
Years of experience 287 22.94 10.79 3.00 48.00 
Math Olympiad 294 0.19 0.40 0.00 ~ 1.00 
Update Courses 294 0.94 0.24 0.00 ~ 1.00 
Satisfy with teacher job 290 3.70 0.84 2.00 5.00 
Implicit bias 

IAT Gender 303 0.09 0.37 -1.03 1.08 
Self-reported explicit bias 

WVS Gender Equality 292 0.16 0.37. 0.00 1.00 
Gender Dif Innate Ability 283 1.51 0.75 1.00 3.00 
Reason GenderGap: Interest for STEM 258 2.57 0.98 1.00 4.00 
Reason GenderGap: Predisposition for STEM 242 2.11 1.04 1.00 5.00 
Reason GenderGap: Low self-esteem 279 2.64 1.05 1.00 5.00 
Reason GenderGap: Family support 280 3.14 1.09 1.00 5.00 
Reason GenderGap: Cultural Stereotypes 281 2.15 1.16 1.00 5.00 
Boys better in Invalsi 235 0.20 0.40 0.00 1.00 
Girls better in Invalsi 235 0.32 0.47 0.00 = 1.00 
Gender Equal in Invalsi 235 0.48 0.50 0.00 1.00 
Observations 303 








Notes: First-hand data from teachers’ questionnaire. We restrict the sample to teachers 
matched to students and therefore used in the main analysis of this paper. The balance ta- 
ble with the difference between teachers’ matched and not matched with students’ data is pre- 
sented in Table A.III. The main reason for not matching teachers with students is that they 
were not teaching in the school before 2016. 
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Table II. Summary Statistics of students by gender 

















Males Females Diff. se 
Baseline characteristics 
Std Math grade 6 0.230 0.036 0.194*** (0.020) 
Std Ita grade 6 0.084 0.214 -0.130*** (0.019) 
Born in the North 0.848 0.854 -0.006 (0.007) 
Born in the Center/South 0.028 0.030 -0.003 (0.003) 
Immigrant 0.189 0.174 0.015 (0.008) 
Second Gen. Immigrant 0.080 0.075 0.005 (0.006) 
HighEduMother 0.452 0.448 0.004 (0.010) 
High Occupation Father 0.166 0.171 -0.005 (0.008) 
Medium Occupation Father 0.321 0.303 0.018 (0.010) 
Outcomes 
Std Math grade 8 0.190 -0.024 0.214*** (0.020) 
Std Ita grade 8 -0.010 0.170 -0.180*** (0.020) 
High-school Track: Scientific 0.305 0.208 0.097*** ~~ (0.010) 
High-school Track: Classic 0.042 0.079 -0.036*** (0.005) 
High-school Track: Other Academic 0.096 0.335 -0.239*** (0.009) 
High-school Track: Technical Technological 0.311 0.067 0.244*** — (0.008) 
High-school Track: Technical Economic 0.113 0.163 -0.050*** (0.008) 
High-school Track: Vocational 0.133 0.148 -0.015* (0.008) 
Track recommendation: Scientific 0.165 0.110 0.055*** (0.008) 
Track recommendation: Vocational 0.363 0.299 0.063*** (0.011) 
Own ability: all subjects 0.656 0.646 0.010 (0.012) 
Own ability: math 0.833 0.747 0.087** — (0.030) 
Own ability: Italian 0.917 0.968 -0.051** (0.018) 
Observations 4684 4599 





Notes: This table reports the summary statistics and the difference between the two genders 
in outcomes and baseline characteristics. *, ** and *** indicate significance at the 10%, 5% 
and 1% percent level respectively. 
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Table III. Correlation between teachers’ characteristics and Gender IAT Score 


Panel A: Independent variables (background teachers’ characteristics) 











Female Age HighMotherEdu — Children Daughters 
() (2) (3) (4) (5) 
Dep. Var.: 
Raw IAT -0.190** 0.009 -0.059 0.061 0.050 
(0.082) (0.059) (0.057) (0.146) (0.076) 
Obs. 303 303 303 303 303 
R? 0.344 0.325 0.336 0.328 0.329 





Panel B: Independent variables (cultural traits and beliefs) 
BornNorth WomenLFP WVSCityBorn WVSIndiv  InnateAbility 











(1) (2) (3) (4) (5) 
Dep. Var.: 
Raw IAT -0.159** -0.512** 0.391* 0.014 0.014 
(0.064) (0.244) (0.211) (0.086) (0.041) 
Obs. 303 288 263 303 303 
R? 0.347 0.360 0.400 0.323 0.324 





Panel C: Independent variables (education and teacher experience) 











Ad.STEM Laude FullContract Olympiad JobSatisfy 
(1) (2) (3) (4) (5) 

Dep. Var.: 
Raw IAT -0.094 -0.033 -0.058 0.051 0.055* 

(0.076) (0.074) (0.145) (0.086) (0.032) 
Obs. 303 303 303 303 303 
R? 0.330 0.324 0.325 0.308 0.334 
School FE Yes Yes Yes Yes Yes 


Notes: This table reports OLS estimates of the correlation between math teachers’ stereo- 
types measured by IAT score and own teacher characteristics; the unit of observation is 
teacher ¢ in school s. Standard errors are robust and clustered at school level in parentheses; 
the number of clusters is 90. School fixed effects are included in all regressions. The signif- 
icance and magnitude of coefficients are not significantly impacted by the inclusion of FE. 
The variable “Female” indicates the gender of the teacher, “Born in the North ” assumes 
value | if the teacher was born in the North of Italy, “HighMotherEdu” is a dummy which 
assumes value | is the mother of the teacher has at least a diploma,“Children” and “Daugh- 
ters” are dummies which assumes value | if the teacher has children/daughters. The variable 
“Ad.STEM” assumes value | if the teacher has a degree in math, engineering and physics, 
“Laude” is a dummy which assumes value | if the degree was achieved with laude, “Full 
Contract” assumes value | is the teacher has tenure, “Olympiad” is 1 for teachers in charge 
of math Olympiad in the school, “JobSatisfy” is a categorical variable from 1 to 5 which 
captures self-reported job satisfaction of teachers, “Updates” captures whether teachers fol- 
lowed update courses in teaching during the academic year, ““WomenLFP” is the labor force 
participation of women in the province of birth, “WVSCityBorn” is the WVS answer to the 
relative rights of men and women to paid jobs when the latter are scarce, “WVSIndiv” is the 
answer to the same question at individual level, “InnateAbility” regards the teacher belief 
about innate differences in math abilities between men and women. We include the order of 
IATs for math teachers and missing categories if the information is not available. *, ** and 
*“* indicate significance at the 10%, 5% and 1% percent level respectively. 
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Table IV. Exogeneity of assignment of students to math teachers with different bias 


Dependent Variable: implicit gender stereotypes of math teacher (standardized) 


(1) (2) (3) (4) (5) (6) (7) 











Fem 0.006 -0.009 0.003 0.011 0.008 -0.026 0.301 
(0.012) (0.020) (0.024) (0.015) (0.012) (0.122) (0.230) 
Fem*HighEduMother 0.028 0.031 0.003 
(0.031) (0.029) (0.039) 
HighEduMother 0.021 0.012  -0.009 
(0.025) (0.023) (0.026) 
Fem*Medium Occupation Father 0.020 0.015 0.077 
(0.033) (0.030) (0.050) 
Medium Occupation Father 0.009 -0.001 0.005 
(0.022) (0.021) (0.028) 
Fem*High Occupation Father -0.007 -0.013 —-0.026 
(0.037) (0.034) (0.054) 
High Occupation Father 0.024 0.019 0.004 
(0.028) (0.025) (0.034) 
Fem*Immigrant -0.027 0.017 0.040 
(0.036) (0.037) (0.069) 
Immigrant 0.056** 0.040 0.076 
(0.027) (0.027) (0.048) 
Fem* Std Ita grade 6 -0.001 -0.001 -0.011 
(0.014) (0.013) (0.020) 
Std Ita grade 6 -0.009 -0.013 -0.014 
(0.012) (0.012) (0.014) 
Fem* Std Mat grade 5 0.002 
(0.021) 
Std Mat grade 6 0.002 
(0.013) 
School, year FE Yes Yes Yes Yes Yes Yes Yes 
Teacher Control No No No No No Yes Yes 

Obs. 9282 9282 9282 9282 9253 9253 1647 
R? 0.409 0.409 0.409 0.409 0.416 0.491 0.765 





Notes: This table reports OLS estimates of the correlation between math teachers’ stereotypes measured by 
IAT score and students’ characteristics; the unit of observation is student i, in class c taught by teacher f in 
grade 8 of school s. Standard errors are robust and clustered at math teacher level in parentheses; the number 
of clusters is 303 in columns 1-6 and 131 in column 7. The variable “Fem” indicates the gender of the stu- 
dent, “HighEduMother” assumes value | if the mother has at least a 5 years diploma, “Medium Occupation 
Father” assumes value | if the father is a teacher or office worker, while “High Occupation Father” is 1 if the 
father is manager, university professor or an executive. “Immigrant” assumes value | is the student is not an 
Italian citizen, while “Std Mat grade 5” and “Std Ita grade 6” are the standardized test score in grade 5 in math 
and grade 6 in Italian respectively. Teacher controls include teacher gender, place of birth, advanced STEM 
degree (as physics, math, engineering), leader of school math Olympics, degree with honour, update courses, 
age, type of contract, education of the teacher’ mother and self-reported gender bias and their interactions 
with students’ gender. All regression include controls for the order of IAT in the questionnaire administered. 
For 29 students we do not observe the test score in Italian in grade 6. The last column has a lower number 
of observations since the test score in grade 5 is available only for part of the sample. *, ** and *** indicate 
significance at the 10%, 5% and 1% percent level respectively. 
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Table V. Estimation of the effect of teachers’ gender stereotypes on math standardized test score 
in grade 8 - class FE regression 


Dependent Variable: Math standardized test score in grade 8 











(1) (2) (3) (4) (5) 
Fem -0.221*** -0.077*** —_ -0.090*** -0.044 -0.038 
(0.019) (0.014) (0.015) (0.033) (0.103) 
Fem*Teachers’ Stereotypes -0.030** = -0.031** = -0.042*** 
(0.014) (0.014) (0.015) 
Fem* Teacher Fem -0.055 
(0.037) 
Fem*North Teacher 0.011 
(0.030) 
Fem* Advanced STEM Teacher -0.044 
(0.031) 
Std Math grade 6 0.724***  —0.724*** ~—-0.699*** ~——-0..700*** 
(0.012) (0.012) (0.013) (0.013) 
Gender Gap -0.221 -0.077 -0.077 -0.081 -0.081 
Class FE Yes Yes Yes Yes Yes 
Student Controls No No No Yes Yes 
Teacher Controls No No No No Yes 
Obs. 9282 9282 9282 9282 9282 
R2 0.209 0.619 0.619 0.626 0.626 





Notes: This table reports OLS estimates of equation 1, where the dependent variable is math 
standardized test score in grade 8; the unit of observation is student i, in class c taught by teacher 
t in grade 8 of school s. Standard errors are robust and clustered at math teacher level in paren- 
theses; the number of clusters is 303. The number of fixed effects (classes) is 552. The variable 
“Fem” indicates the gender of the student. Individual controls include education of the mother, 
occupation of the father, immigrant dummy, generation of immigration and their interactions 
with the gender of the student. All columns include the interaction between math standardized 
test score in grade 6 and students’ gender. Teacher controls include the interaction between stu- 
dents’ gender and teacher gender, place of birth, age, children and daughters, advanced STEM 
degree (as physics, math, engineering), leader of school math Olympics, degree with honour, 
update courses, type of contract and education of the teacher’ mother. *, ** and *** indicate sig- 
nificance at the 10%, 5% and 1% percent level respectively. 
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Table VI. Estimation of the effect of teachers’ gender stereotypes on math standardized test 
score in grade 8 - school FE regression 


Dependent Variable: Math standardized test score in grade 8 

















(1) (2) (3) (4) (5) 
Fem -0.218*** = -0.077*** —_-0.087*** -0.027 -0.040 
(0.019) (0.014) (0.015) (0.033) (0.107) 
Teachers’ Stereotypes -0.013 -0.013 -0.006 
(0.016) (0.015) (0.015) 
Fem*Teachers’ Stereotypes -0.024* -0.026* = -0.034** 
(0.014) (0.014) (0.014) 
Math Teacher Fem 0.067* 
(0.040) 
Fem*Math Teacher Fem -0.050 
(0.040) 
Math Teacher born North 0.031 
(0.035) 
Fem*North Math Teacher 0.014 
(0.031) 
Advanced STEM 0.028 
(0.034) 
Fem* Advanced STEM Teacher -0.032 
(0.034) 
Std Math grade 6 0.717***  ~=0.716*** ~—0.689*** ~—-0.689*** 
(0.011) (0.011) (0.012) (0.012) 
Gender Gap -0.218 -0.072 -0.072 -0.082 -0.083 
School, year FE Yes Yes Yes Yes Yes 
Student Controls No No No Yes Yes 
Teacher Controls No No No No Yes 
Obs. 9282 9282 9282 9282 9282 
R2 0.135 0.575 0.576 0.584 0.588 


Notes: This table reports OLS estimates of equation 2, where the dependent variable is math 
standardized test score in grade 8; the unit of observation is student i, in class c taught by 
teacher ¢ in grade 8 of school s. Standard errors are robust and clustered at math teacher level 
in parentheses; the number of clusters is 303. The number of fixed effects (school by cohort) is 
185. The variable “Fem” indicates the gender of the student. Individual controls include educa- 
tion of the mother, occupation of the father, immigrant dummy, generation of immigration and 
their interactions with the gender of the student. All columns include the interaction between 
math standardized test score in grade 6 and students’ gender. Teacher controls include teacher 
gender, place of birth, children and daughters, advanced STEM degree (as physics, math, en- 
gineering), leader of school math Olympics, degree with honour, update courses, age, type of 
contract, education of the teacher’ mother and the interaction with students’ gender of all these 
characteristics. We include a control for whether the class has an extended school day and the 
interaction with the gender of students. *, ** and *** indicate significance at the 10%, 5% and 
1% percent level respectively. 
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Table VII. Estimation of the effect of teachers’ gender stereotypes 


Dependent Variable: Math standardized test score in grade 8 











Heterogeneous effects by Student Characteristics Interaction time 
with teacher 
(1) (2) (3) (4) (5) (6) 
Fem -0.038 -0.037 0.001 -0.036 -0.064 -0.056 


(0.103) (0.103) (0.113) (0.103) (0.103) = (0.106) 
Fem*Teachers’ Stereotypes(IAT) -0.042*** = -0.057*** — -0.078*** = -0.040** = -0.047*** —_ -0.065* 
(0.015) (0.022) (0.029) (0.017) (0.018) (0.034) 








Fem*IAT*HighEduM 0.028 
(0.030) 
Fem*IAT*Top tercile Math6 0.112*** 
(0.039) 
Fem*IAT*Middle tercile Math6 0.009 
(0.038) 
Fem*IAT*Immigrant -0.014 
(0.042) 
Fem*IAT*Extended School Day 0.023 
(0.028) 
Fem*IAT*Same Math Teacher 0.028 
(0.038) 
Gender Gap -0.082 -0.082 -0.082 -0.082 -0.082 -0.082 
Class FE Yes Yes Yes Yes Yes Yes 
Student Controls Yes Yes Yes Yes Yes Yes 
Teacher Controls Yes Yes Yes Yes Yes Yes 
Obs. 9282 9282 9282 9282 9282 9282 
R? 0.626 0.627 0.628 0.627 0.627 0.627 


Notes: This table reports OLS estimates of the heterogeneous impact of math teachers’ gender stereotypes 
measured by IAT score on math standardized test score in grade 8 by observable characteristics of the stu- 
dent and by interaction time with teacher; the unit of observation is student i, in class c taught by teacher ¢ in 
grade 8 of school s. Standard errors are robust and clustered at math teacher level in parentheses; the num- 
ber of clusters is 303. The number of fixed effects (classes) is 552. The variable “Fem” indicates the gender 
of the student, “HighEduM” wether the mother has at least a diploma, “tercile Math6” is the tercile of stan- 
dardized test score in math in grade 6 and “Immigrant” is a dummy equal to | if the student is not Italian 
citizen. Individual controls include education of the mother, occupation of the father, immigrant dummy, 
generation of immigration and their interactions with the gender of the student. All columns include the in- 
teraction between math standardized test score in grade 6 and students’ gender. Teacher controls include 
the interaction between students’ gender and teacher gender, place of birth, age, children and daughters, ad- 
vanced STEM degree (as physics, math, engineering), leader of school math Olympics, degree with honour, 
update courses, type of contract and education of the teacher’ mother. Regressions are all fully saturated 
even if not all interactions are shown in the table. *, ** and *** indicate significance at the 10%, 5% and 1% 
percent level respectively. 
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Table VIII. Estimation of the effect of teachers’ gender bias on track choice- class FE 























(1) (2) (3) (4) (5) (6) (7) (8) 
Panel A- Dependent Variable: High-School Track Choice 
Scientific Academic Vocational 
Female -0.095*** — -0.044*** 0.030 0.166* 0.014 0.001 0.028 0.053 
(0.012) (0.012) (0.020) (0.092) (0.009) (0.011) (0.023) (0.073) 
Fem*Stereotypes 0.011 0.010 0.003 0.025** 0.022** 0.024** 
(0.013) (0.012) (0.012) (0.010) (0.010) (0.010) 
Fem* Teacher Fem -0.033 0.037* 
(0.029) (0.022) 
Std Math grade 6 0.178*** = 0.159*** —-0.160*** -0.104***  -0.091*** = -0.091*** 
(0.008) (0.008) (0.008) (0.007) (0.007) (0.007) 
Constant 0.300*** = 0.243*** =0.107*** 0.109*** =—0.141*** —s0.174*** ——0.206*** ~—:0..205*** 
(0.006) (0.006) (0.015) (0.015) (0.005) (0.006) (0.016) (0.016) 
MeanYFem 0.205 0.205 0.205 0.205 0.155 0.155 0.155 0.155 
Obs. 8451 8451 8451 8451 8451 8451 8451 8451 
R2 0.115 0.216 0.236 0.238 0.119 0.190 0.208 0.211 
Panel B- Dependent Variable: Teachers’ Recommendation 
Scientific Academic Vocational 
Fem -0.045*** = -0.018* 0.034** 0.028 -0.059*** = -0.103*** = -0.116*** -0.031 
(0.010) (0.010) (0.016) (0.081) (0.013) (0.012) (0.024) (0.091) 
Fem*Stereotypes 0.004 0.003 -0.005 0.018 0.018 0.024** 
(0.010) (0.010) (0.010) (0.011) (0.011) (0.012) 
Fem* Teacher Fem -0.054** 0.025 
(0.025) (0.037) 
Std Math grade 6 0.127" = =0.113*** 0.114" -0.245*** = -0.216*** = -0.217*** 
(0.009) (0.009) (0.009) (0.008) (0.008) (0.008) 
Constant 0.156*** = 0.130*** ~—-0.060*** 0.096 0.377*** = 0.429*** = -0.515*** —-2..344*** 
(0.005) (0.004) (0.011) (0.134) (0.006) (0.006) (0.017) (0.117) 
MeanYFem 0.109 0.109 0.109 0.109 0.318 0.318 0.318 0.318 
Obs. 7084 7084 7084 7084 7084 7084 7084 7084 
R? 0.156 0.243 0.253 0.255 0.148 0.360 0.387 0.388 
Class FE Yes Yes Yes Yes Yes Yes Yes Yes 
S Controls No No Yes Yes No No Yes Yes 
T Controls No No No Yes No No No Yes 





Notes: This table reports OLS estimates of equation 1, where the dependent variable is the high-school track choice; the 
unit of observation is student i, in class c taught by teacher ¢ in grade 8 of school s. Standard errors are robust and clus- 
tered at math teacher level in parentheses; the number of clusters is 303. The number of fixed effects (classes) is 551. 
The variable “Fem” indicates the gender of the student and “Stereotypes” is the IAT score of the teacher. Individual con- 
trols include education of the mother, occupation of the father, immigrant dummy, generation of immigration and their 
interactions with the gender of the student. All columns include the interaction between math standardized test score 
in grade 6 and students’ gender. Teacher controls include the interaction between students’ gender and teacher gender, 
place of birth, children and daughters, advanced STEM degree (as physics, math, engineering), leader of school math 
Olympics, degree with honour, update courses, age, type of contract and education of the teacher’ mother. *, ** and *** 
indicate significance at the 10%, 5% and 1% percent level respectively. 
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Table IX. Estimation of the effect of teachers’ gender bias on self-confidence- class FE 























(1) (2) (3) (4) (5) (6) (7) 
Panel A- Dependent Variable: Being good/mediocre at math (vs. being bad) 
Fem -0.094*** -0.086*** = -0.114* 0.147 -0.066** -0.088 0.174 
(0.029) (0.029) (0.066) (0.189) (0.028) (0.066) (0.200) 
Fem*Teacher Stereotypes -0.049** = -0.053** —-0.072** — -0.032 -0.036 -0.056* 
(0.023) (0.024) (0.033) (0.023) (0.025) (0.032) 
Constant 0.837*** —0.808***  0.809*** 0.800*** 0.810*** 0.820*** 0.812*** 
(0.015) (0.015) (0.048) (0.047) (0.015) (0.048) (0.046) 
Std Test score math No Grade 6 Grade6 Grade6 Grade8 Grade8 Grade 8 
Obs. T47 TA47 TAT TA47 T47 TAT T47 
R2 0.110 0.216 0.236 0.253 0.248 0.266 0.281 
Panel B- Dependent Variable: Being good/mediocre at Italian (vs. being bad) 
Fem 0.052** 0.072*** 0.061 0.177 0.064*** 0.051 0.148 
(0.023) (0.025) (0.047) (0.214) (0.023) (0.045) (0.202) 
Fem*Teacher Stereotypes 0.041** = -0.041** 0.029 0.042** = 0.042** 0.032 
(0.019) (0.020) (0.024) (0.018) (0.020) (0.023) 
Constant 0.916*** 0.908*** 0.937*** 0.946*** 0.917*** 0.953*** 0.963*** 
(0.012) (0.012) (0.034) (0.035) (0.011) (0.034) (0.035) 
Std Test score Italian No Grade6  Grade6 Grade6 Grade8 Grade8 Grade 8 
Obs. 664 664 664 664 664 664 664 
R2 0.115 0.134 0.148 0.175 0.148 0.161 0.189 
Panel C- Dependent Variable: Average own ability in other subjects 
Fem 0.035 0.013 0.015 -0.222 0.008 0.009 -0.230 
(0.027) (0.028) (0.061) (0.222) (0.027) (0.061) (0.223) 
Fem*Teacher Stereotypes -0.015 -0.016 -0.022 -0.020 -0.021 -0.027 
(0.025) (0.026) (0.029) (0.026) (0.026) (0.029) 
Std Test score math No Grade 6 Grade6 Grade6 Grade8 Grade&8 Grade 8 
Obs. 802 802 802 802 802 802 802 
R?2 0.096 0.125 0.137 0.157 0.130 0.141 0.161 
Class FE Yes Yes Yes Yes Yes Yes Yes 
Student Controls No No Yes Yes No Yes Yes 
Math Teacher Controls No No No Yes No No Yes 





Notes: This table reports OLS estimates of equation |, where the dependent variable is self-stereotypes in grade 
8; the unit of observation is student i, in class c taught by teacher f in grade 8 of school s. Standard errors are 
robust and clustered at math teacher level in parentheses; the number of clusters for math is 58. The number 
of fixed effects (classes) is 62. The variable “Fem” indicates the gender of the student. Individual controls in- 
clude education of the mother, occupation of the father, immigrant dummy, generation of immigration and their 
interactions with the gender of the student. All columns include the interaction between math standardized test 
score in grade 6 and students’ gender. Teacher controls include the interaction between students’ gender and 
teacher gender, place of birth, children and daughters, advanced STEM degree (as physics, math, engineering), 
leader of school math Olympics, degree with honour, update courses, age, type of contract and education of the 
teacher’ mother. *, ** and *** indicate significance at the 10%, 5% and 1% percent level respectively. 
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